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ABSTRACT 


The evolving discipline of computational pain investigation provides modern 
gears to recognize the pain. This discipline uses Computational processing of 
difficult pain-associated records and relies on "intelligent" Machine learning 
algorithms. By mining information from difficult pain- associated records and 
generating awareness from this, facts will be simplified. Therefore, machine 
learning has the capability to encouragement the training and dealing of pain 
greatly. Indeed, the application of machine learning for pain investigation - 
associated non-imaging problems has been mentioned in publications in 
scientific journals since 1940-2018. Among machine learning methods, a subset 
has so far been applied to pain research-related problems, SVMs, regression 
models, deep learning and several kinds of neural networks so far most often 
revealed in the pain literature. Machine learning receives increasing general 
interest and appears to penetrate many parts of everyday life and natural 
sciences. This affinity is likely to spread to pain investigation. The current review 
objectives to familiarize pain area professionals with the methods and current 
applications of machine learning in pain investigation, possibly simplifying the 
awareness of the methods in current and future assignments. 

KEYWORDS: Machine Learning, SVM, Deep Learning, Neural Network, 
application of Machine Learning. 


I. Introduction 

In the last decade, the amount of biomedical data gathered by 
the bioinformatics and neuroscience communities has grown 
exponentially. This large amount of data, coming in different 
forms such as genomes, gene expression data, gene in 
bioinformatics, or many modalities of structural and 
functional imaging in neuroscience, led to the need for 
efficient and effective computational tools to analyze and 
interpret these data. 

In neuroscience, neuroimaging techniques, such as 
computerized tomography (CT], positron emission 
tomography (PET], functional magnetic resonance imaging 
(fMRl], and diffusion tensor imaging (DTI], are used to study 
brains in vivo and to understand the inner workings of the 
nervous system. One of the main research goal of this field is 
to analyze the human brain network, often referred to as 
human connectome, in order to understand its anatomical 
and functional organization. Gaining such an understanding is 
fundamental to support early diagnoses of neurological 
disorders and to improve treatments of these pathologies. 
Indeed, connectome data analysis has led to the discovery of 
biomarkers associated to several neurological conditions. 
Neuroimaging data are complex and high dimensional and 
come in a wide range of spatial and temporal resolutions; for 
these reasons, advanced analysis techniques are necessary to 
describe data derived from each imaging method. 

Machine learning (ML] deals with the analysis and 
development of models and algorithms able to learn from 


data in order to perform predictive analysis. Here, learning is 
intended as the capability of ML models to adapt their 
configurations to optimally explain the observed data. 
Usually, the objective is to extract knowledge from a data 
collection in order to make predictions about new, previously 
unseen, patterns. 

II. Machine Learning 

In this brief, we will attempt to introduce a few basic 
techniques that are widely applicable and then show how 
these can be used in various medical data imaging settings 
using examples from our past work in this field. For further 
information, interested researcher should refer well-known 
introductions to machine learning, such as the excellent 
treatments with modern technologies in the field of 
Pharmaceutical Research (PR]. 

Artificial Intelligence is a consideration of intelligence of 
machines, is shorthand for any task a computer can 
accomplish just as well as and better than, human being. But 
there are different forms of computer intelligence to consider 
when thinking about its role in Pharmaceutical research. 
Most of the simulated based solutions now emerging in 
healthcare do not rely on independent computer intelligence. 
Rather, they use mortal fashioned algorithms as the basis for 
analyzing data and recommending usages for patients. By 
distinction, "machine learning” trusts on neural networks. 
Such applications involve multilevel probabilistic analysis 
(MPA], allowing computers to mimic and even enlarge on the 
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way the human mind processes data. As a result, not even the 
computer scientist can be sure howtheir computer programs 
will originate resolutions. There’s yet another AI variant, 
known as "deep learning," wherein software learns to 
recognize patterns in distinctive layers. In healthcare, this 
mechanism is becoming increasingly useful. Because each 
neural-network layer operates both independently and in 
concert - separating aspects such as color, size and shape 
before integrating the outcomes - these newer digital 
painterly tools hold the promise of transforming diagnostic 
medicine and can even search for brain cancer or other type 
of cancers at the individual cell level. Artificial Intelligence 
(AI] can be sliced and diced many different ways, but the best 
way to understand its potential use in healthcare is to break 
down its applications into three separate categories: 
algorithmic explanations, painterly tools and medical practice 
in PR. 

III. Machine Learning And Biomedicine Sub-fields 

Machine Learning is a branch of AI- has been well pre¬ 
arranged as an effective tools for investigators, for solving 
investigative and extrapolative problems in a variety of 
medicinal arenas. ML is being used for the analysis of the 
importance of medical constraints and their mixtures for 
prediction, likewise prediction of disease progression, 
removal of medicinal acquaintance for consequence 
investigation, treatment preparation and provision, and for 
the complete easy-going administration. ML is also being 
used for data investigation, such as detection and recognition 
of symmetries in the data by suitably allocating with deficient 
data, clarification of continuous data, used in the Exhaustive 
Precaution Unit, and intellectual alarming resultant in 
effective and efficient observing. It is argued that the 
successful implementation of ML approaches can assistance 
the integration of computational schemes in the healthcare 
environment providing opportunities to enable and improve 
the work of medicinal specialists and eventually to recover 
the competence and superiority of medicinal healthcare. Over 
the past few years, there have been a number of publications 
of investigation outcomes that prerogative that Deep 
Learning (DL] has been practical to, and often 
times instantaneously outclasses clinicians in, a particular 
capacity of judgement. One ofthe industries currently being 
transformed by ML today is the healthcare production and, in 
this review paper, we have summarize and assembled, 
important applications of machine learning inside healthcare 
that are currently either being used or industrialized. 

The One Hundred Year Study on Artificial Intelligence and 
machine learning is an enduring study of AI/ML and its 
impact on individuals and humanity. The shallow neural 


network (SNN) started in 1940’s. The eight relevant areas 
that are considered most salient include healthcare (the other 
seven being transference, service automata, learning, low- 
resource societies, community security and safety, occupation 
and workroom, and showbiz]. Already used in areas such as 
radiology, pathology, genomic medicine, cardiology, 
outpatient services, and intensive care, AI will continue to 
have escalating impact in medicine although with associated 
fear amongst participants for an AI "takeover”, especially 
their employments. Generally there are three Main types of 
ML subfields: Supervised, Unsupervised and Reinforcement 
Machine Learning, but in literature there are enormous types 
of subfields of ML available. The Figure 1 shows the major ML 
methods in biomedicine Sub-fields. 

PubMed Central® (PMC] [68] is a free full-text 
documentation of biomedical and life sciences journal 
literature at the U.S. National Institutes of Health's National 
Library of Medicine. In a PubMed search for reports for 
artificial intelligence and medicine yielded 11,751 and almost 
equal number of 12,792 in surgery since 1950. Interestingly, 
a search in public health resulted in 35,098 publications. Of 
note, there are some reports that are focused on robotics 
under surgery and surgical subspecialties. Al-related reports 
in various subspecialties since 1950 are separated into "high’, 
"medium", and "low” use groups. The subspecialties range 
from almost 10,001 reports in Oncology. It is not surprising 
given its focus on genomic sequencing data and drug 
discovery, to about 6,000 in medical image-intensive fields 
such as Pathology and Radiology, and more than 1,000 
articles in data-intensive fields (such as Epidemiology and 
Critical Care Medicine]. Interestingly, Neurology and 
Neurosurgery are on this list of top subspecialties that have 
an AI presence as there is much emphasis of neuroscience 
currently in artificial intelligence. Finally, most subspecialties, 
even in low and medium use subspecialties, have seen an 
increase of at least 59% in number of published reports from 
2005 to 2016. Of note, there is an incredulously low of 27 Al- 
related publications in Neonatology likewise astonishing 
especially since it is a data-focused specialty in a critical care 



Algorilhms Algorilhms Learning Algorithms Algorithms Algorithms 

Figure 1: Major Machine Learning Methods 


Supervised 

Learning(SL]. 

SL as the name designates a presence of supervisor as Instructor. Basically supervised learning 
is a learning in which we demonstrate or train the machine using data which is well labeled that 
means some data is already identified with correct answer. 

Unsupervised 

LearningfUSL], 

USL is the training of machine using material that is neither categorized nor tagged and 
allowing the algorithm to act on that information without supervision. 

Semi-Supervised 

Learning(SSL]. 

Problems where you have a large amount of available data and only some of the data is 
categorized, are called SSL problems. 

Reinforcement 

Learning(RL]. 

RL is an important type of Machine Learning where an agent learn how to perform in a 
atmosphere by execution actions and seeing the outcomes. 

Evolutionary 

learning(EL]. 

Biological evolution can be seen as a learning process: living structure adapt to improve their 
continuity rates and chance of having minor in their environment. We’ll look at how we can 
model this in a computer, using an idea of fitness, which corresponds to a score for how good the 
current solution is. 
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Multitask Learning 
Technique (MLT). 

MLT has a simple task of helping other learners to perform efficiently. When multitask learning 
algorithms are performed on a task, it remembers the procedure how it solved the problem or 
how it reaches to the particular conclusion. 

Ensemble Learning(EL). 

When various individual learners are combined to form only one learner then that particular 
type of learning is called EL. 

Active learning(AL]. 

AL [known as "query learning” or "optimal experimental design”) is a subfield of machine 
learning and, more generally, AL The key hypothesis is that if the learning algorithm is allowed to 
choose the data from which it learns—to be "curious,” if you will—it will perform better with less 
training. 

Multi-label 

learning(MLL). 

MLL studies the problem where each example is represented by a single instance while 
associated with a set of labels simultaneously. 

Transfer learning(TL). 

TL has been proposed to allow the domains, tasks, and circulations to be different, which can 
extract knowledge from one or more source tasks and apply the knowledge to a target task. The 
benefit of transfer learning is that it can intelligently apply knowledge learned beforehand to 
solve new problems faster. 

Transductive 

inference(TI). 

As in the semi-supervised scenario, the learner receives a labeled training sample along with a 
set of unlabeled test points. However, the objective of TI is to predict labels only for these 
particular test points. However, as in the semi-supervised setting, the assumptions under which 
a better performance can be achieved in this setting are research questions that have not been 
fully resolved. 

Genetic Algorithms and 
Genetic 

Programming(GA& GP]. 

The genetic approach to ML is a relatively new concept. Both genetic algorithms (GA) and 
Genetic Programming (GP) are a form of evolutionary computing(EC) which is a collective name 
for problem solving techniques based on the principles of biological evolution like natural 
selection. Genetic algorithms use a vocabulary borrowed from natural genetics in that they talk 
about genes (orbits), chromosomes (individuals or bit strings), and population (of individuals) 
[1], GA approach is centered on three main processes- crossovers, mutation and selection of 
individuals. Genetic algorithms are based upon the Darwin theory of "The survival of the Fittest" 
depending upon the fitness function the best possible solutions are selected from the pool of 
individuals. 

Rule Induction (RI). 

RI is another very important ML method and it is easier because the rules in rule induction are 
transparent and easy to interpret than a regression model or a trained neural network. This 
model employs condition-action rules, decision trees, or similar knowledge structures. Here the 
performance element sorts instances down the branches of the decision tree or finds the first 
rule whose conditions match the instance, typically using an all-or none match process [2]. 
Information about classes or predictions is stored in the action sides of the rules or the leaves of 
the tree. 

Dictionary Learning 
Technique (DL). 

DLT, or as it is often called sparse coding, is a basic building block of many machine learning 
systems. This algorithmic primitive arises in applications ranging from denoising, edge- 
detection, super-resolution and compression. DLT is also used in the design of some DL 
architectures. Popular approaches to solving this problem in practice are variants of the standard 
alternating minimization approach. 

Representation 

Learning(RL]. 

RL aims to achieve that a reasonably sized learned representation can capture a huge number of 
possible input configurations, which can greatly facilitate improvements in both computational 
efficiency and statistical efficiency. In order to give impetus to the multi domain learning ability 
of representation learning, automatic representation learning, biased representation learning, 
cross-domain representation learning, and some other related techniques have been proposed in 
current years. The rapid increase in the technical activity on representation learning has been 
accompanied and sustained by a remarkable string of empirical successes in real-world 
applications, such as speech recognition, natural language processing (NLP), and intelligent 
vehicle systems. 

Distributed and parallel 
learning(DAPL]. 

DAPL seems to be a auspicious research since allocating the learning process among several 
workstations is a natural way of scaling up learning algorithms. Different from the traditional 
learning context, in which one requires the collection of that data in a database for central 
processing, in the framework of distributed learning, the learning is carried out in a distributed 
manner. In the past years, several popular distributed machine learning algorithms have been 
proposed, including decision rules, stacked generalization, meta-learning, and distributed 
boosting. With the advantage of distributed computing for managing big volumes of data, 
distributed learning avoids the necessity of gathering data into a single workstation for central 
processing, saving time and energy [3], 

Knowledge Transfer 
Learning(KTL) 

According to Soumava Roy et.al. [4] KTL Techniques that benefit from available knowledge to 
work around a difficult learning problem (e.g., learning tasks with insufficient data for training) 
are commonly referred to as knowledge transfer methods. In this part, we first discuss the 
problem of Zero- and One-Shot Learning (ZSL/OSL) from deep learning viewpoint. As the names 
imply, the number of training samples per class is zero for ZSL and merely one for OSL. 

Finally, we examine Teacher-Student Networks (TSNs) J5J that distil the knowledge of a large 
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model into a smaller one. 

A. Zero- and One-Shot Learning (ZSL/OSL). 

B. Zero-Shot Learning (ZSL). 

C. One-Shot Learning [OSL). 

D. Teacher-Student Networks Learning (TSNL). 

E. Few-Shot Learning fFSL). 

Batch learning(BL]. 

In BL, the system is incapable of learning incrementally: it must be trained using all the available 
data. This is called offline learning. If you want a BL system to know about new data [such as a 
new type of spam), you need to train a new version of the system from scratch on the full dataset 
[not just the new data, but also the old data), then stop the old system and replace it with the 
new one. 

Online learning (OLJ. 

In OL, we train the system incrementally by feeding it data instances sequentially, either 
individually or by small groups called mini-batches. OL is great for systems that receive data as a 
continuous flow (e.g., stock prices) and need to adapt to change rapidly or autonomously. 

Decision tree based 
algorithms (DTBA). 

DTBA define models that are iteratively or recursively constructed based on the data provided. 
The goal of Decision tree based algorithms is to predict the value of a target variable given a set 
of input variables. Decisions fork in tree structures until a prediction decision is made for a given 
record. 

Bayesian method based 
algorithms(BMBA). 

BMBA are those that explicitly apply the Bayesian inference theorem and again solve 
classification and regression problems. Bayesian methods facilitate subjective probability in 
modeling. 

Kernel method based 
algorithms(KMBA]. 

When we hear about kernel methods, the first thing that comes to mind is Support Vector 
Machines (SVM). These methods are usually a group of methods in themselves. Kernel methods 
are concerned with configurations analysis and as explained in the preceding sections, that crux 
of configurations analysis includes various mapping techniques. 

Clustering Methods(CM). 

Clustering, like regression, describes a class of problems and a class of methods. Clustering 
methods are typically organized by the modeling approaches such as centroid-based and 
hierarchical. 

Artificial Neural 

Networks [ANN). 

Similar to kernel methods, ANN are again a class of configurations matching techniques, but 
these models are inspired by the structure of biological neural networks. These methods are 
again used to solve classifications and regression problems. They relate to DL modeling and have 
many subfields of algorithms that help solve specific problems in context. 

Dimensionality 

Reduction(DR). 

Like clustering methods, DR methods work iteratively and on the data structure in an 
unsupervised manner. Given the dataset and the dimensions, more dimensions would mean 
more work in the Machine learning implementation. The idea is to iteratively reduce the 
dimensions and bring more relevant dimensions forward. 

Instance based learning 
algorithms(IBLA). 

Instances are nothingbut subsets of datasets, and IBLA work on an identified instance or groups 
of instances that are critical to the problem. The results across instances are compared, which 
can include an instance of new data as well. This comparison uses a particular similarity measure 
to find the best match and predict. Instance based methods are also called case-based or 
memory-based learning. Here the focus is on the representation of the instances and similarity 
measures for comparison between instances. 

Regression analysis 
based algorithms 
(RABA). 

Regression is a process of refining the model iteratively based on the error generated by the 
model. Regression also is used to define a ML problem type. 

Association rule based 
learning algorithms 
[ARBLA). 

Given the variables, association rule based learning algorithms extract and define rules that can 
be applied on a dataset and demonstrate experienced-based learning, and thus prediction. These 
rules when associated in a multi-dimensional data context can be useful in a commercial context 
as well. 

Fuzzy Logic [FL). 

FL was first industrialized by Zadeh [6] in the mid-1960s for representing uncertain and vague 
knowledge. It provides an estimated but effective means of describing the behavior of systems 
that are too complex, ill-defined, or not easily analyzed precisely. Fuzzy variables are handled 
using a system called a fuzzy logic controller. It includes fuzzification, fuzzy inference, and 
defuzzification. Zadeh discusses that the attempts to automate various types of activities from 
assembling hardware to medical diagnosis have been impeded by the gap between the way 
human beings reason and the way computers are programmed. FL uses graded statements 
rather than ones that are strictly true or false. 

Exact learning [EL). 

In this setting the goal is to learn a target concept from the ability to interact with it. For 
concreteness, we focus on learning target concepts that are Boolean functions: the target is some 
unknown c: [0,l}° -* {0.1} coming from a known concept class £ of functions, [7] andourgoalis 
to identify c exactly, with high probability, using membership queries (which allowthe learnerto 
learn for x of his choice). 
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PAC (Probably 
Approximately Correct] 
learning. 

In this setting one also wants to learn an unknown c: {Q,l} n -s> {0,1} from a known concept class 
C, but in a more passive way than with membership queries: the learner receives several labeled 
examples (x.c(x)}, where xis distributed according to some unknown probability distribution D 
over CUP. The learner gets multiple i.i.d. labeled examples. From this limited "view” on c, the 
learner wants to generalize, producing a hypothesis h that probably agrees with c on "most" x, 
measured according to the same D. This is the classical PAC model. 

Agnostic learning(AL). 

In this setting one wants to approximate a distribution on [0 , l}" ^ 1 by finding a good hypothesis 
h to predict the last bit from the first n bits. A "good” hypothesis is one that is not much worse 
than the best predictor available in a given class £ of available hypotheses. The agnostic model 
has more freedom than the PAC model and allows to model more realistic situations, for example 
when the data is noisy or when no "perfect" target concept exists. Like in the PAC model, it turns 
out quantum sample complexity is not significantly smaller than classical sample complexity in 
the agnostic model J7J. 

Metric Learning Method 
(MLM], 

This approach involves learning a metric space in which learning is particularly efficient. This 
approach has mostly been used for few-shot classification. Intuitively, if our goal is to learn from 
a small number of example images, than a simple approach is to compare the image that we are 
trying to classify with the example images that you have. But, as we might imagine, comparing 
images in pixel space won’t work well. Instead, we can train a Siamese network or perform 
comparisons in a learned metric space. 

Classical machine 
learning (CML]. 

CML and data analysis can be divided into several categories. First, computers can be used to 
perform 'classic' data analysis methods such as least squares regression, polynomials 
interpolation, and data analysis. Machine learning protocols can be supervised or unsupervised. 

Quantum Machine 
Learning(QML). 

According to Cristian Romero Garcia [8] QML aims to implement machine learning algorithms in 
quantum systems, by using the quantum properties such as superposition and entanglement to 
solve these problems efficiently. The field of classical machine learning is receiving lot of 
attention and investments from the industry. Nowadays, due to the huge quantities of data with 
which we deal every day, new approaches are needed to automatically manage, organize and 
classify these data. Classical machine learning, which is a flexible and adaptable procedure, can 
recognize configurations efficiently, but some of these problems cannot be efficiently solved by 
these algorithms. The companies whose labour consists in big databases management are aware 
of these limitations, and are very interested in new approaches to accomplish this. They have 
found in quantum machine learning one of these approaches. Nevertheless, there is not a general 
theory to analyze and engineer new QML algorithms, and there are additionally some 
unanswered related questions. 

Adversarial Machine 
Learning(AML]. 

Systems using ML have been successfully deployed for fighting spam, fraud, and other malicious 
activities. These systems typically consist of a classifier that flags certain instances as malicious 
based on a fixed set of features. For example, spam filters classify each incoming email message 
as spam or legitimate email by using a set of features such as which words are present. 
Unfortunately, as classifiers become more widely deployed, the incentive for defeating them 
increases. 

Temporal difference 
learning (TDL). 

TDL IS AN APPROACH TO LEARNING HOWTO PREDICTA QUANTITY THATDEPENDS ON FUTURE VALUESOFAGIVEN 

SIGNAL. The name TDL derives from its use of changes, or differences, in predictions over successive 

TIME STEPS TO DRIVE THE LEARNING PROCESS. 

Multi-Task Learning 
(MTL], 

Multi-task learning is an approach used to aggregate together similar tasks or problems and train 
a computer system to learn how to resolve collectively the tasks or problems. This leads to the 
resolution of more complex tasks or problems within the shortest possible time using machine 
learning. 

Greedy Learning 
Algorithms (GLA]. 

We will look into a common computer science algorithm technique called the greedy algorithms 
The idea of greedy algorithm is to make each choice in a locally optimal manner. 

Cloud Machine Learning 
(CML], 

The commercial clouds are in a race to see who can provide the most interesting and useful AI 
services on their cloud platform. The big breakthroughs came when deep learning models 
trained on massive data collections began to reach levels of human accuracy. 
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Meta-learning System 
(MLS). 

MLS are trained by being exposed to a large number of tasks and are then tested in their ability 
to learn new tasks; an example of a task might be classifying a new image within 5 possible 
classes, given one example of each class, or learning to efficiently navigate a new maze with only 
one traversal through the maze. During MLS, The Model Is Trained To Learn Tasks In The Meta- 
Training Set. There Are Two Optimizations At Play - The Learner, Which Learns New Tasks, And 
The Meta-Learner, Which Trains The Learner. Methods For Meta-Learning Have Typically Fallen 
Into One Of Three Categories: Recurrent Models, Metric Learning, And Learning Optimizers J9J. 

Q-Learning (QL). 

QL [10], is widely used in Reinforcement Learning (RL) and has had many positive results in 
playing stochastic games, elevator control and robotics. Q-Learning produces an action value 
function or Q function, Q:S X A — !E, which maps every state-action pair to the expected utility of 
taking the action in the state and following the greedy policy thereafter. The greedy policy of a Q 
function is one that simply chooses the action with the highest Q-value for any state. QL is an off- 
policy learning algorithm, meaning it can learn from example trajectories that differ from its 
policy. To learn from demonstrations, we take advantage of this ability and simply perform QL 
along the demonstrated trajectory. 

Fog Computing (FC). 

According to M.S. Mahdavinejad et.al.fll], the architecture of fog computing is applied to migrate 
information from the data centers task to the edge of the servers. This architectures is built 
based on the edge servers.This architecture has been and is being implemented in vital areas like 
eHealth and military applications. 

Edge Computing (EC). 

According to M.S. Mahdavinejad et.al. [11],In this architecture, processing is run at a distance 
from the core, toward the edge of the network. This type of processing enables data to be initially 
processed at the edge devices. Devices at the edge may not be connected to the network in a 
continuous manner, so they need a copy of master data/reference data for online processing. 
Edge devices have different features such as 1) enhancing security, 2) filtering and cleaning of 
the data, and 3) storing local data for local use. 

Socially Guided Machine 
Learning (SGML). 

Agents (e.g., robotic helpers, non-player characters in games, etc.) that operate in human 
environments will be more flexible if they are able to learn new tasks and skills from their end- 
users. We recognize that while the average person is not familiar with machine learning 
techniques, they are intimately familiar with social learning (e.g. Tutelage, imitation). SGML [12] 
explores ways in which machine learning can be designed to more fully take advantage of natural 
human interaction and tutelage. It asks questions like "how do people want to teach agents?" and 
"how do we design agents to learn effectively from natural human interaction and instruction?" 
Inspiration for SGML comes (in part) from Situated Learning Theory, a field of study that looks at 
the social world of children and how it contributes to their development. A key concept is 
scaffolding, where a teacher provides support such that a learner can achieve something they 
would not be able to accomplish independently. 

ROBOT Machine 
learning (ROBOTML). 

Considering the structure of a typical robot system, there are numerous opportunities to learn. 
They include learning of direct sensor-actor couplings on the lowest level of robot control as well 
as the acquisition of knowledge about the robot's environment or the continuous improvement 
of the robot's reasoning and planning capabilities, in order to enable it to handle increasingly 
complex tasks over time. 

The Hebbian 
Learning(HL). 

D onald H ebb is credited with being among the first to think about this problem and attempt to 
come up with a plausible solution in his 1949 book The Organization of Behavior. However, Hebb 
was primarily a psychologist and his ideas were stated in rather vague terms, such as: "When an 
axon of cell A is near enough to excite cell B and repeatedly or persistently takes part in firing it, 
some growth process or metabolic change takes place in one or both cells such that A’s efficiency, 
as one of the cells firing B, is increased" often paraphrased as "Neurons that fire together wire 
together". Not a single equation can be found in his book. While the concept of HL has played an 
important role in the development of both neuroscience and ML[13], 

Competitive 

Learning(CL). 

In CL, as the name implies, the output neurons of a neural network compete among themselves 
for being the one to be active (fired). Thus, whereas in a neural network based on Hebbian 
learning several output neurons may be active simultaneously, in the case of CL a single output- 
neuron is active at any one time. It is this feature that makes CL highly suited to discover those 
statistically salient features that may be used to classify a set of input configurations[14]. 

The Boltzmann learning 
(BL). 

The BL rule, named in honor of L. Boltzmann, is a stochastic learning algorithm derived from 
information-theoretic and thermodynamic considerations. Here we merely emphasize the 
essence of what BL is. In a Boltzmann machine, the neurons constitute a recurrent structure, and 
they operate in a binary manner in that they are either in an "on” state denoted by + 1 or in an 
"off" state denoted by -1. The machine is characterized by an energy function E, the value of 
which is determined by the particular states occupied by the individual neurons of the 
machine[14]. 
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Error-Correction 

Learning (ECL). 

Let Tj, (n)denote some desired response or target response for neuron k at time n. The response 
SkCh) is produced by a stimulus [vector) x(n) applied to the input of the network in which 
neuron k is embedded. The input vector x(n) and desired response Tj, (n) for neuron k constitute 
a particular example presented to the network at time n. It is assumed that this example and all 
other examples presented to the network are generated by an environment that is probabilistic 
in nature, but the underlying probability distribution is unknown. Typically, the actual response 
S|,(n) of neuron k is different from the desired response T^(n). Hence, we may define an error 
signal as the difference between the target response Tj,(n)and the actual response S^(n), as 
shown by K],[n)= T^(n)- £j,(n)The ultimate purpose of ECL is to minimize a cost function based 
on the error signalK]<(n), such that the actual response of each output neuron in the network 
approaches the target response for that neuron in some statistical senseJ14J. 

Deep learning (DLJ. 

DL is an area of Machine learning that focuses on unifying Machine learning with artificial 
intelligence. In terms of the relationship with artificial neural networks, this field is more of an 
advancement to artificial neural networks that work on large amounts of common data to derive 
practical insights. It deals with building more complex neural networks to solve problems 
classified under semi-supervised learning and operates on datasets that have little labeled data. 

Convolutional neural 
networks(CNN) 

CNN are deep artificial neural networks majorly used to classify images, clustering based on 
similarity, and object recognition. Related algorithms used to identify faces, street signs, 
tumours, platypuses and different aspects of visual dataJ15J. 

Recurrent neural 
network (RNN) 

RNN is an advanced artificial neural network to build on earlier types of networks with fixed- 
size input vectors and output vectors [16] .The RNN is related to deep learning to simulate the 
neural activity in the human brain. RNNs widely used for Sentiment Classification, Image 
captioning and language translation fl 7]. 


Table 1: Machine Learning And Biomedicine Sub-Fileds Details 


IV._ Research applications in healthcare 


S. No. 

Area 

Details 

1 . 

Evidence Based Medicine (EBM) 

It is an approach taken by medical consultants to diagnose and 
medicate sicknesses using medical evidence. It involves 
collection, interpretation, and summarization of evidences 
followed by systematic retrieval of the best evidence available 
and then applying them in practice 

2. 

Adverse Drug Events (ADE) 

It is a damage experienced by a patient resulting from the use of 
a drug. It may occur due to suppository errors like overdose, 
wrong medication, adverse drug reactions or wrong patient. 
Identification of factors relating to adverse drug events is one of 
the major areas of research in medicine. 

3. 

Healthcare Management (HM) 

It is health management using monitoring equipment and 
devices whose objective is to help patients manage their medical 
conditions at home. Systems are designed which help to identify 
chronic disease state and keep track of high risk patients, and 
design appropriate interventions and reduce number of hospital 
admissions. 

4. 

Predictive Data Mining [PDM) in clinical 
medicine 

Uses enduring specific information to derive models that can 
predict the outcome of interest and thus support clinical 
decision making 

5. 

Medical Decision Support System (MDSS) 

It is a framework which enables Medical decision making in the 
presence of partial information. A medical decision support 
system is a health information technology system designed to 
provide doctors and health professionals with assistance during 
decision making. 

6. 

Passive In-Home Health and Wellness 
Monitoring 

These systems are used for monitoring older adults passively in 
their own living settings through placing sensors in their living 
environment. Their routine activities are analyzed and mined to 
detect changes in their health conditions or indicators of early 
onset of disease. 

7. 

MARTDIAB :A Communication and 
Information Technology (IT) Approach for 
the Intelligent monitoring, Management 
and Follow-Up of Type -1 Diabetes Patients 

An automated system for monitoring and management of type 1 
Diabetic patients is developed which supports monitoring, 
management and treatment of patients with type 1 diabetes. 

8. 

HCloud : A Innovative Application Oriented 
Cloud Platform for Precautionary 
Healthcare 

HCloud - A healthcare system is developed for preventive 
healthcare service. It is used by many healthcare workers to 
access and consolidate all patient electronic medical records 
which gives fast access to patient 
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9. 

Effective Management of Hospital resource 

Using data mining, it is possible to detect the chronic disease 
and based on the complication of the patient disease prioritize 
the patients so that they will get effective treatment in timely 
and accurate manner. Fitness report and demographic details of 
patients is also useful for utilizing the available hospital 
resources effectively. 

10. 

Hospital Ranking [HR] 

Different data mining approaches are used to analyze the 
various hospital details in order to determine their ranks. 

11. 

Better Customer Relation 
[BCR] 

Data Mining helps the healthcare institute to understand the 
needs, preferences, behavior, patterns and quality of their 
customer in order to make better relation with them. Using Data 
Mining, Customer Potential Management Corp. develops an 
index represent the utilization of Consumer healthcare. This 
index helps to detect the influence of customer towards 
particular healthcare service. 

12. 

Hospital Infection Control [HIC] 

A system for inspection is constructed using data mining 
techniques to discover unknown or irregular patterns in the 
infection control data. To control the infection in the hospitals, 
this information is reviewed further by an Expert. 

13. 

Smarter Treatment Techniques [STT] 

Using Data Mining, physicians and patients can easily compare 
among different treatments technique. 

14. 

Improved Patient Care [IPC] 

Large amount of data is collected with the advancement in 
electronic health record. Patient data which is available in 
digitized form improve the healthcare system quality. In order 
to analyze this massive data, a predictive model is constructed 
using data mining that discover interesting information from 
this huge data and make decision regarding the improvement of 
healthcare quality. 

15. 

Decrease Insurance Fraud [DIF] 

Healthcare insurer develops a model to detect the fraud and 
abuse in the medical claims using data mining techniques. Check 
improper medicaments, false data claimed by peoples. 

16. 

Recognize High-Risk Patients [RHRP] 

The main concern of this system is to handle the diabetic 
patients, improve their health quality and also offers cost 
savings services to the patient. Using Predictive model, 
healthcare provider recognize the patient which require more 
concern as compare to other patients 

17. 

Health Policy Planning [HPP] 

Data mining play an important role for making effective policy 
of healthcare in order to improve the health quality as well as 
reducing the cost for health services. 

18. 

Understanding Medical Data [UMD] 

In pharmaceutical research, the Internet of Things [IoT] devices 
such as the Fitbit on the growth, the number of ways in which to 
collect vast amounts of medical data from anonymous sources is 
increasing. 

19. 

Robotic Surgery [RS] 

Robotic Surgery is nothing new but the recent advancements 
already in progressive adopted in the field of PR. 

20. 

Automatic 

ML systems within robotic surgeons are one way in which 
automation could find itself integrated into pharmaceutical 
Research of the future. 

21. 

Radioactivity Treatments 

Robotic surgeon successfully treated as and compare healthy 
and cancerous tissues and cells is key to then building the most 
appropriate treatment plans whereas, the human based or 
involvement treatment may be very risky based on radiation. 

22. 

Generating Electronic Smart Card [GESC] 

With the enormous capacities of medical and healthcare data 
now available, the implementation of smart electronic 
healthcare records has become essential. This, however, creates 
even more challenges for those already working within the 
business industry. Machine learning projects make an up a 
significant percentage of the efforts going into assisting in the 
creation of such smart records with handwriting recognition 
technologies and vision of such developments. Other machine 
learning applications in the creation of electronic smart records 
involve using records with built-in artificial intelligence so as to 
assist with keeping medical records, interpreting health 
conditions and suggesting treatment plans. 
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23. 

Optimizing Clinical Trials (OCT) 

There have been several calls for machine learning technologies 
to be more closely involved in clinical research trials. 

24. 

Personalization Medicines [PM) 

PM and treatments have been discussed and debated for many 
years now, however, with technological advances in healthcare 
devices, artificial intelligence, machine learning and centralized 
system such treatments could as soon as available. As previously 
mentioned, machine learning technologies can be used to 
interpret the high volumes of patient data collected by IoT and 
healthcare devices and then use these interpretations to predict 
conditions or suggest treatments. With both the widespread 
adoption of AI and machine learning and the increasing 
prevalence of data-gathering medical devices, such solutions are 
currently under investigation by industry players such as IBM 
and could soon be commonplace within healthcare institutions 
and facilities. 

25. 

Future Prediction (FP) 

We predict future outcomes of the healthcare with help of Ml 
and DL, this is beneficial for the humanities. 


Table 2: Research Application In Healthcare 


V._ Medical Application last 03 years 


References 

Techniques 

Modality 

Area 

Year 

CQ 

Q 

Method 

Result 

CO 

T—1 

2 

2 

H&E 

< 

2 

2014 

| 

Annotation 

This paper used multiple instance learning framework in 
classification training with deep learning features. In this paper 
Several interesting conclusions can be obtain : automatic feature 
learning outperforms manual feature; the unsupervised approach 
can achieve performance that's close to fully supervised approach 
93.56% vs. 94.52%; and the MIL performance of coarse label 96.30% 
outweighs the supervised performance of fine label 95.40% in 
supervised deep learning features. 

T—1 

M-CNN 

H&E 

Breast 

2016 

AM1DA 

The AggNet model almost performs around 8.6% of AUC as good as 
the augmented ground truth model AM-GT and easily outperforms 
both AM-MV and AM-GLAD around 7.6% of AUC. 

O 1 

CN 

2 

u 

Pm 

H&E 

< 

2 

2017 

| 

Extensive experiments using the 2015 MICCAI Gland Challenge 
dataset and a lymph node ultrasound image segmentation dataset 
show that, using annotation suggestions by our method, state-of-the- 
art segmentation performance can be achieved by using only 50% of 
training data. 

T—1 

CN 

M- 

CNN 

H 

U 

Lung 

2015 

L1DC- 

IDRI 

Classification 

Experimental results demonstrate the effectiveness of M-CNN 
method on classifying malignant and benign nodules without nodule 
segmentation. 

rT 

CN 

3D-CNN 

5 

2 

Brain 

2015 

ADNI 

Demonstrate that 3D convolutional neural networks outperform 
several other classifiers reported in the literature and produce state- 
of-art results. 

rn 1 

CN 

CNN +RNN 

CQ 

O 

CC 

CD 

w 

2015 

; 

The proposed system is validated on a large population-based 
dataset over 5378 images, where it outperforms the state of the art 
by extracting with respect to clinical grading a mean absolute error 
(c) of 0.304, a 70.7% exact integral agreement ratio [R0], an 88.4% 
decimal grading error <0.5 (Re0.5), and a 99.0% decimal grading 
error <1.0 (Rel.0). 

CN 

CNN 

X-ray 

Knee 

2016 

O.E.l 

Results on a dataset of X-ray images and KL grades from the 
Osteoarthritis Initiative [OAI] show a sizable improvement over the 
current state-of-the-art. 
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LO 

CN 

2 

2 

u 

H&E 

Thyroid 

2016 

| 


This paper based on the Thyroid based machine learning. 

VO 1 

CN 

3D-CNN, 3D-CAE 

2 

2 

Brain 

2016 

ADNI 


Experiments on the ADNI MRI dataset with no skull-stripping 
preprocessing have shown 3D-CNN outperforms several 
conventional classifiers by accuracy and robustness. Abilities of the 
3D-CNN to generalize the features learnt and adapt to other domains 
have been validated on the CAD Dementia dataset. 

R 1 

CN 

M-CNN 

CQ 

a 

c* 

C 

2 

CO 

2016 

Dermofit 


The entire network is fine-tuned in a fully learned end-to-end 
optimization with auxiliary loss functions. In this paper show how 
proposed novel multi-tract network yields higher classification 
accuracy, outperforming state-of-the-art multi-scale approaches 
when compared over a public skin lesion dataset. 

CO 1 

CN 

2 

2 

u 

CQ 

a 

Skin, Eye 

2016 

EDRA, DRD 


In this report investigate how different transfer learning schemes 
influence automated melanoma classification results. This paper 
evaluated transfer learning from a general-context dataset 
(ImageNet] and from a specific-domain dataset (diabetic 
retinopathy]. Authors investigated if sequential transfer steps 
improve the final classification result. This paper show results 
consistent with the literature regarding training a DNN from scratch 
and differences between doing fine-tuning or not. Author conclude 
that general findings of deep learning state-of-the-art are also 
applicable for automated melanoma screening literature, thus 
guiding future research 

CN 

M-CNN 

E- 

u 

Lung 

2016 

LIDC-IDRI, ANODE09, DLCST 


The proposed architecture comprises multiple streams of 2-D 
ConvNets, for which the outputs are combined using a dedicated 
fusion method to get the final classification. Data augmentation and 
dropout are applied to avoid overfitting. On 888 scans of the publicly 
available LIDCIDRI dataset, our method reaches high detection 
sensitivities of 85.4% and 90.1% at 1 and 4 false positives per scan, 
respectively. An additional evaluation on independent datasets from 
the ANODE09 challenge and DLCST is performed. This paper showed 
that the proposed multi-view ConvNets is highly suited to be used 
for false positive reduction of a CAD system. 

O 1 

ro 

3D-CNN 

H 

U 

Lung 

2018 

LIDC-IDRI, 

LUNA16 


In this paper Extensive experimental results demonstrate that Deep 
Lung has performance comparable to experienced doctors both for 
the nodule-level and patient-level diagnosis on the LIDC-IDRI 
dataset. 

T—1 

ro 

3D-CNN 

5 

2 

Brain 

2016 

| 


Experimental results demonstrate that the methods can achieve an 
accuracy as high as 89.9% this paper also find that the learned 
features from fMRI and DTI play more important roles in accurately 
predicting the OS time, which provides valuable insights into 
functional neuro-oncological applications. 

rT 

CO 

w 

< 

CO 

US, CT 

Breast, Lung 

2016 

LIDC-IDRI 


The experimental results show the significant performance boost by 
the SDAE-based CADx algorithm over the two conventional methods, 
suggesting that deep learning techniques can potentially change the 
design paradigm of the CADx systems without the need of explicit 
design and selection of problem-oriented features. 

CO 1 

CO 

w 

< 

u 

a 

2 

Breast 

2016 

; 


The state-of-the-art results show that the learned breast density 
scores have a very strong positive relationship with manual ones, 
and that the learned texture scores are predictive of breast cancer. 

The model is easy to apply and generalizes too many other 
segmentation and scoring problems. 
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5 

ro 

MIL-CNN 

a 

2 

Breast 

2017 

IN breast 


Experimental results on the IN breast dataset demonstrate the 
robustness of proposed networks compared to previous work using 
segmentation and detection annotations. 

lh 

ro 

2 

u 

o 

2 

2 

Brain 

arXiv 

ADNI, ABIDE 


This paper, introduce the novel concept of Graph Convolutional 
Networks (GCN) for brain analysis in populations, combining 
imaging and non-imaging data. This paper represent populations as 
a sparse graph where its vertices are associated with image-based 
feature vectors and the edges encode phenotypic information. This 
structure was used to train a GCN model on partially labelled graphs, 
aiming to infer the classes of unlabeled nodes from the node features 
and pairwise associations between subjects. The Paper demonstrate 
the potential of the method on the challenging ADNI and ABIDE 
databases, as a proof of concept of the benefit from integrating 
contextual information in classification tasks. This has a clear impact 
on the quality of the predictions, leading to 69.5% accuracy for 

ABIDE (outperforming the current state of the art of 66.8%) and 

77% for ADNI for prediction of MCI conversion, significantly 
outperforming standard linear classifiers where only individual 
features are considered. 

vo 1 

CO 

CNN 

CQ 

a 

Oh 

C 

2 

CO 

2017 

| 


This paper demonstrate classification of skin lesions using a single 
CNN, trained end-to-end from images directly, using only pixels 
and disease labels as inputs. Authors train a CNN using a dataset of 
129,450 clinical images—two orders of magnitude larger than 
previous datasets—consisting of 2,032 different diseases. It test 
its performance against 21 board-certified dermatologists on 
biopsy-proven clinical images with two critical binary 
classification use cases: keratinocyte carcinomas versus benign 
seborrheic keratosis; and malignant melanomas versus benign 
nevi. The first case represents the identification of the most 
common cancers, the second represents the identification of the 
deadliest skin cancer. The CNN achieves performance on par with 
all tested experts across both tasks, demonstrating an artificial 
intelligence capable of classifying skin cancer with a level of 
competence comparable to dermatologists. Outfitted with deep 
neural networks, mobile devices can potentially extend the reach 
of dermatologists outside of the clinic. 

R 1 

CO 

FCN + CNN 

2 

S 

Liver-Liver Tumor 

2017 

| 


In this method the framework achieves an end-to-end accuracy of 

65% with a Dice score for the automatic lesion segmentation of 69% 
and an accuracy of 68% for tumor malignancy classification based on 
expert annotations. Authors compared the Survival Net to classical 
handcrafted features such as Histogram and Horlicks and show 
experimentally that Survival Net outperforms the handcrafted 
features in HCC malignancy classification. End-to-end assessment of 
tumor malignancy based on our proposed fully automatic framework 
corresponds to assessment based on expert annotations with high 
significance (p>0.95). 

CO 1 

CO 

MLP 

E- 1 

U 

Head-Neck 

2015 


Detection / 
Localization 

The proposed method has been quantitatively valuated for carotid 
artery bifurcation election on a head- neck CT dataset from 455 
patients. Compared to the state-of-the-art, the mean error is reduced 
by more than half, from 5.97 mm to 2.64 mm, with a detection speed 
of less than 1 s/volume. 
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O' 

CO 

z 

z 

u 

on 

2 

Fetal 

2015 

| 


Authors Propose a transfer learning strategy, which transfers the 
knowledge in the low layers of a base CNN trained from a large 
database of natural images to our task-specific CNN. Extensive 
experiments demonstrate that our approach outperforms the state- 
of-the-art method for the FASP localization as well as the CNN only 
trained on the limited US training samples. The proposed approach 
can be easily extended to other similar medical image computing 
problems, which often suffer from the insufficient training samples 
when exploiting the deep CNN to represent high-level features. 

O 1 

2.5D-CNN 

2 

2 

Femur 

2015 

< 

o 


This paper used the narrow-band graph cut optimization to achieve 
the 3D segmentation of femur surface. Finally, the anatomical 
landmarks are located on the femur according to the geometric cues 
of surface mesh. Experiments demonstrate that the proposed 
method is effective, efficient, and reliable to segment femur and 
locate the anatomical landmarks 

T—i 

21 

LSTM 

on 

3 

Fetal 

2015 



This paper propose a joint learning framework with knowledge 
transfer across multi-tasks to address the insufficiency issue of 
limited training data. Extensive experiments on different US 
standard planes with hundreds of videos corroborate that method 
can achieve promising results, which outperform state-of-the-art 
methods. 

2i 

z 

z 

u 

X-ray, 

MRI 

Hand 

2016 

DHADS 


This paper describe a machine learning method based on the CNN 
applied on X-ray, MRI. 

co 1 

21 

z 

z 

u 

MRI, US, CT 

■ 

2016 

SATCOM 


This paper describe a new learning method following a 
fundamentally different paradigm by simultaneously modeling both 
the object appearance and the parameter search strategy as a unified 
behavioral task for an artificial agent. The method combines the 
advantages of behavior learning achieved through reinforcement 
learning with effective hierarchical feature extraction achieved 
through deep learning. The paper show that given only a sequence of 
annotated images, the agent can automatically and strategically learn 
optimal paths that converge to the sought anatomical landmark 
location as opposed to exhaustively scanning the entire solution 
space. The method significantly outperforms state-of-the art 
machine learning and deep learning approaches both in terms of 
accuracy and speed on 2D magnetic resonance images, 2D 
ultrasound and 3D CT images, achieving average detection errors of 
1-2 pixels, while also recognizing the absence of an object from the 
image. 

21 

2 

u 

Pm 

on 

3 

Fetal 

2016 



In this paper, considered a fully automated system based on 
convolutional neural networks which can detect twelve standard 
scan planes as defined by the UK fetal abnormality screening 
programme. The network design allows real-time inference and can 
be naturally extended to provide an approximate localization of the 
fetal anatomy in the image. Such a framework can be used to 
automate or assist with scan plane selection, or for the retrospective 
retrieval of scan planes from recorded videos. The method is 
evaluated on a large database of 1003 volunteer mid-pregnancy 
scans. The paper show that standard planes acquired in a clinical 
scenario are robustly detected with a precision and recall of 69% 
and 80%, which is superior to the current state-of-the-art. 
Furthermore, paper show that it can retrospectively retrieve correct 
scan planes with an accuracy of 71% for cardiac views and 81% for 
non-cardiac views. 
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LO 1 

CNN+LSTM 

2 

2 

Heart 

2016 



This paper propose a novel deep learning architecture, named as 
temporal regression network (TempReg-Net), to accurately identify 
specific frames from MRI sequences, by integrating the 

Convolutional Neural Network [CNN] with the Recurrent Neural 
Network [RNN]. Specifically, a CNN encodes the spatial information 
of a cardiac sequence, and a RNN decodes the temporal information. 

In addition, the authors design a new loss function in our network to 
constrain the structure of predicted labels, which further improves 
the performance. In this paper, approach is extensively validated on 
thousands of cardiac sequences and the average difference is merely 
0.4 frames, comparing favorably with previous systems. 


M-CNN 

2 

2 

Heart 

2016 



The methods are evaluated on three different data sets with different 
numbers of patients: 59 patients for sclerotic metastases detection, 
176 patients for lymph node detection, and 1,186 patients for colonic 
polyp detection. Experimental results show the ability of ConvNets 
to generalize well to different medical imaging CADe applications 
and scale elegantly to various data sets. In this paper the authors 
proposed improved CADe performance methods markedly in all 
cases. CADe sensitivities improved from 57% to 70%, from 43% to 
77% and from 58% to 75% at 3 FPs per patient for sclerotic 
metastases, lymph nodes and colonic polyps, respectively. 

FT 

CNN 

PET/CT 

Heart 

2016 



The overall scheme detects pulmonary nodules using both CT and 

PET images. In the CT images, a massive region is first detected using 
an active contour filter, which is a type of contrast enhancement 
filter that has a deformable kernel shape. Subsequently, high-uptake 
regions detected by the PET images are merged with the regions 
detected by the CT images. FP candidates are eliminated using an 
ensemble method; it consists of two feature extractions, one by 
shape/metabolic feature analysis and the other by a CNN, followed 
by a two-step classifier, one step being rule based and the other 
being based on support vector machines. 

The authors evaluated the detection performance using 104 PET/CT 
images collected by a cancer-screening program. The sensitivity in 
detecting candidates at an initial stage was 97.2%, with 72.8 

FPs/case. After performing the proposed FP-reduction method, the 
sensitivity of detection was 90.1%, with 4.9 FPs/case; the proposed 
method eliminated approximately half the FPs existing in the 
previous study. 

co 1 

3D-CNN 

2 

2 

Brain 

2016 



In this paper authors proposed a cascaded framework under 3D 

CNNs for the task of CMB detection. In this paper first exploit a 3D 
fully convolutional network (FCN) strategy to retrieve the 
candidates with high probabilities of being CMBs, and then apply a 
well-trained 3D CNN discrimination model to distinguish CMBs from 
hard mimics. Compared with traditional sliding window strategy, the 
proposed 3D FCN strategy can remove massive redundant 
computations and dramatically speed up the detection process. In 
this paper constructed a large dataset with 320 volumetric MR scans 
and performed extensive experiments to validate the proposed 
method, which achieved a high sensitivity of 93.16% with an average 
number of 2.74 false positives per subject, outperforming previous 
methods using low-level descriptors or 2D CNNs by a significant 
margin. The proposed method, in principle, can be adapted to other 
biomarker detection tasks from volumetric medical data. 


CNN 

X-ray, MG 

■ 

2016 

NIH, 

China, DDSM.MIAS 


In this work, authors present a novel weakly supervised learning 
framework for lesion localization named as self-transfer learning 
[STL]. STL jointly optimizes both classification and localization 
networks to help the localization network focus on correct lesions 
without any types of priors. The authors evaluate STL framework 
over chest X-rays and mammograms, and achieve significantly better 
localization performance compared to previous weakly supervised 
localization approaches. 
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o 

LO 

CNN 

CQ 

U 

C* 

CD 

w 

2016 

DRD, MESSIDOR 


In this paper, authors proposed a method to improve and speed-up 
the CNN training for medical image analysis tasks by dynamically 
selecting misclassified negative samples during training. Training 
samples are heuristically sampled based on classification by the 
current status of the CNN. Weights are assigned to the training 
samples and informative samples are more likely to be included in 
the next CNN training iteration. Authors evaluated and compared 
our proposed method by training a CNN with (SeS) and without 
(NSeS) the selective sampling method. The paper focus on the 
detection of hemorrhages in color fundus images. A decreased 
training time from 170 epochs to 60 epochs with an increased 
performance-on par with two human experts-was achieved with 
areas under the receiver operating characteristics curve of 0.894 and 
0.972 on two data sets. The SeS CNN statistically outperformed the 
NSeS CNN on an independent test set. 

T—1 

LO 

GAN 

■ 

' 

2017 


In this paper authors proposed AnoGAN, a deep convolutional 
generative adversarial network to learn a manifold of normal 
anatomical variability, accompanying a novel anomaly scoring 
scheme based on the mapping from image space to a latent space. 
Applied to new data, the model labels anomalies, and scores image 
patches indicating their fit into the learned distribution. Results on 
optical coherence tomography images of the retina demonstrate that 
the approach correctly identifies anomalous images, such as images 
containing retinal fluid or hyper reflective foci. 

r\T 

LO 

2 

u 

tX 

X-ray 

Cardiac 

2016 


The authors proposed two novel methods for (i) fully automatic 
electrophysiology catheter electrode detection in interventional X- 
ray images and (ii) single-view depth estimation of such electrodes 
based on convolutional neural networks. For (i), experiments on 24 
different fluoroscopy sequences (1650 X-ray images) yielded a 
detection rate > 99 %. In an experiments on (ii) depth prediction 
using 20 images with depth information available revealed that we 
are able to estimate the depth of catheter tips in the lateral view with 
a remarkable mean error of 6.08±4.666.08±4.66 mm. 

ro 1 

LO 

3D-CNN 

E- 1 

u 

Lung 

2018 

LIDC-IDRI, LUNA16 

The nodule classification subnetwork was validated on a public 
dataset from LIDC-IDRI, on which it achieved better performance 
than state-of-the-art approaches and surpassed the performance of 
experienced doctors based on image modality. Within the Deep Lung 
system, candidate nodules are detected first by the nodule detection 
subnetwork, and nodule diagnosis is conducted by the classification 
subnetwork. Extensive experimental results demonstrate that Deep 
Lung has performance comparable to experienced doctors both for 
the nodule-level and patient-level diagnosis on the LIDC-IDRI 
dataset. 

LO 

U-Net 

' 

■ 

2015 


Segmentation 

The authors show that such a network can be trained end-to-end 
from very few images and outperforms the prior best method (a 
sliding-window convolutional network) on the ISBI challenge for 
segmentation of neuronal structures in electron microscopic stacks. 
Using the same network trained on transmitted light microscopy 
images (phase contrast and DIC) we won the ISBI cell tracking 
challenge 2015 in these categories by a large margin. Moreover, the 
network is fast. Segmentation of a 512x512 image takes less than a 
second on a recent GPU. 

in 

LO 

2 

u 

Ex 

5 

2 

Head-Neck 

2016 


In this paper authors presents a pipeline is extensively evaluated on 
three challenging tasks of lesion segmentation in multi-channel MRI 
patient data with traumatic brain injuries, brain tumors, and 
ischemic stroke. Authors improve on the state-of-the art for all three 
applications, with top ranking performance on the public 
benchmarks BRATS 2015 and ISLES 2015. The method is 
computationally efficient, which allows its adoption in a variety of 
research and clinical settings. The source code of our 
implementation is made publicly available. 
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Liver-Liver Tumor 

2016 



This paper presents a method to automatically segment liver and 
lesions in CT abdomen images using cascaded fully convolutional 
neural networks (CFCNs) and dense 3D conditional random fields 
(CRFsJ. Authors train and cascade two FCNs for a combined 
segmentation of the liver and its lesions. In the first step, we train a 
FCN to segment the liver as ROl input for a second FCN. The second 
FCN solely segments lesions from the predicted liver ROls of step 1. 
Refine the segmentations of the CFCN using a dense 3D CRF that 
accounts for both spatial coherence and appearance. CFCN models 
were trained in a 2-fold cross-validation on the abdominal CT 
dataset 3DIRCAD comprising 15 hepatic tumor volumes. Our results 
show that CFCN-based semantic liver and lesion segmentation 
achieves Dice scores over 94% for liver with computation times 
below 100s per volume. This paper experimentally demonstrate the 
robustness of the proposed method as a decision support system 
with a high accuracy and speed for usage in daily clinical routine. 

3D- 

CNN 

2 

S 

Spin 

e 

2016 

i 

Model-Based Segmentation of Vertebral Bodies from MR Images 
with 3D CNNs 

FT 

LO 

2 

U 

Ex 

H 

U 

Liver-Liver Tumor 

2017 

i 

This paper presents a method to automatically segment liver and 
lesions in CT and MRI abdomen images using cascaded fully 
convolutional neural networks (CFCNs) enabling the segmentation 
of a large-scale medical trial or quantitative image analysis. Authors 
train and cascade two FCNs for a combined segmentation of the liver 
and its lesions. In the first step, we train a FCN to segment the liver 
as ROl input for a second FCN. The second FCN solely segments 
lesions within the predicted liver ROIs of step 1. CFCN models were 
trained on an abdominal CT dataset comprising 100 hepatic tumor 
volumes. Validations on further datasets show that CFCN-based 
semantic liver and lesion segmentation achieves Dice scores over 

94% for liver with computation times below 100s per volume. 

Authors further experimentally demonstrate the robustness of the 
proposed method on an 38 MRI liver tumor volumes and the public 
3DIRCAD dataset. 

CO 1 

LO 

2 

u 

X 

5 

2 

Liver-Liver Tumor 

2017 

' 

A 3D neural network (Survival Net] then predicts the HCC lesions' 
malignancy from the HCC tumor segmentation. We formulate this 
task as a classification problem with classes being "low risk" and 
"high risk" represented by longer or shorter survival times than the 
median survival. We evaluated our method on DWI of 31 HCC 
patients. Our proposed framework achieves an end-to-end accuracy 
of 65% with a Dice score for the automatic lesion segmentation of 

69% and an accuracy of 68% for tumor malignancy classification 
based on expert annotations. Authors compared the Survival Net to 
classical handcrafted features such as Histogram and Horlick and 
show experimentally that Survival Net outperforms the handcrafted 
features in HCC malignancy classification. End-to-end assessment of 
tumor malignancy based on our proposed fully automatic framework 
corresponds to assessment based on expert annotations with high 
significance (p>0.95). 

LO 

3D-CNN 

Diffusion MRI 

Brain 

2016 


The authors demonstrate how deep learning, a group of algorithms 
based on recent advances in the field of artificial neural networks, 
can be applied to reduce diffusion MRI data processing to a single 
optimized step. This modification allows obtaining scalar measures 
from advanced models at twelve-fold reduced scan time and 
detecting abnormalities without using diffusion models. We set a 
new state of the art by estimating diffusion kurtosis measures from 
only 12 data points and neurite orientation dispersion and density 
measures from only 8 data points. This allows unprecedentedly fast 
and robust protocols facilitating clinical routine and demonstrates 
how classical data processing can be streamlined by means of deep 
learning. 

o 

v£ 

GAN 

a 

Breast Mass 

2018 


Multi-scale FCN is employed to improve the segmentation 
performance. Experimental results on two public datasets, IN breast 
and DDSM-BCRP, demonstrate that in this paper end-to-end network 
achieves better performance than state-of-the-art approaches. IN 
breast, DDSM-BCRP 
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T—1 

3D-CNN 

E- 

U 

Liver 

2017 



Introduce a deep supervision mechanism during the learning 
process to combat potential optimization difficulties, and thus the 
model can acquire a much faster convergence rate and more 
powerful discrimination capability. On top of the high-quality score 

In this paper map produced by the 3D DSN, a conditional random 
field model is further employed to obtain refined segmentation 
results. Authors evaluated our framework on the public MICCAI- 
SLiver07 dataset. Extensive experiments demonstrated that our 
method achieves competitive segmentation results to state-of-the- 
art approaches with a much faster processing speed. 


3D-CNN 

5 

2 

Brain 

2017 


In this paper authors proposed a multi-connected domain 
discriminator for improved adversarial training. That system is 
evaluated using two MR databases of subjects with traumatic brain 
injuries, acquired using different scanners and imaging protocols. 
Using our unsupervised approach, Authors obtain segmentation 
accuracies which are close to the upper bound of supervised domain 
adaptation 

cn 

^E 

3D- 

CNN 

H 

U 

Spin 

e 

2016 


The author’s show that how machine learning, worked on spine with 
3D-CNN. 

vo 

2.5D-CNN 

5 

2 


2015 

< 

o 

Regression 

An automatic localization method is proposed in this paper to 
determine positions of initial geometric landmarks on femur surface 
in the 3D MR images. Based on the results from the convolutional 
neural network (CNN) classifiers and shape statistics, authors use 
the narrow-band graph cut optimization to achieve the 3D 
segmentation of femur surface. Finally, the anatomical landmarks 
are located on the femur according to the geometric cues of surface 
mesh. Experiments demonstrate that the proposed method is 
effective, efficient, and reliable to segment femur and locate the 
anatomical landmarks. 

in 

VO 

3D-CNN 

Diffusion MRI 

Brain 

2016 

[HCP] 

The paper demonstrate how DL, a group of algorithms based on 
recent advances in the field of artificial neural networks, can be 
applied to reduce diffusion MRI data processing to a single optimized 
step. This modification allows obtaining scalar measures from 
advanced models at twelve-fold reduced scan time and detecting 
abnormalities without using diffusion models. Here authors set a 
new state of the art by estimating diffusion kurtosis measures from 
only 12 data points and neurite orientation dispersion and density 
measures from only 8 data points. This allows unprecedentedly fast 
and robust protocols facilitating clinical routine and demonstrates 
how classical data processing can be streamlined by means of deep 
learning. 

VO 1 

VO 

z 

z 

u 

CS-MRI 


2017 

| 

Image Reconstruction and 
Post Processing 

The Authors show that the proposed method consistently 
outperforms state-of-the-art methods and is capable of preserving 
anatomical structure more faithfully up to 11-fold under sampling. 
Moreover, reconstruction is very fast: each complete dynamic 
sequence can be reconstructed in less than 10 s and, for the 2-D case, 
each image frame can be reconstructed in 23 ms, enabling real-time 
applications. 

FT 

VO 

z 

< 

a 

CS-MRI 


2017 


The Authors show that the proposed method consistently 
outperforms state-of-the-art methods. 


DL Techniques- NN: Neural Networks, MLP: Multilayer Perceptron, RBM: Restricted Boltzmann Machine, SAE: Stacked 
Auto-Encoders, CAE: Convolutional Auto-Encoders, CNN: Convolutional Neural Networks, RNN: Recurrent Neural 


Networks, LSTM: Long Short Term Memory, M-CNN: Multi-Scale/View/Stream CNN, MIL-CNN: Multi-instance 

Learning CNN, FCN: Fully Convolutional Networks 

Imaging Modality [IM]- US: Ultrasound, MR/MR1: Magnetic Resonance Imaging, PET: Positron Emission, Tomography, 
MG: Mammography, CT: Computed Tomography, H&E: Hematoxylin & Eosin Histology Images, RGB: Optical Images 
_ TABLE 3: Medical Application last 03 years _ 


VI. Conclusion 

Recent advances in ML can be used to solve a remarkable variety of problems—and DL is aggressive the limitations even further. 
Systems enabled by machine learning can provide customer service, achieve logistics, analyze healthcare accounts, or even write 
news floors. Here we collected the various medical image analysis methods likewise 1. Annotation 2. Classification, 3. Detection 
and Localization, 4. Segmentation, 5.Registration, 6. Regression, 7. Image Reconstruction and Postprocessing over the last three 
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years. Since all the above mentioned methods are very important and use full in medical field and PR. The value potential is 
everywhere, even in industries that have been slow to digitize. These technologies could generate productivity gains and an 
improved quality of life—along with job losses and other disruptions. Previous research found that 45 percent of work activities 
could potentially be automated by currently demonstrated technologies; machine learning can be an enabling technology for the 
automation of 80 percent of those activities. Breakthroughs in natural language processing could expand that impact even 
further. 
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